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SYSTEM AND METHOD FOR SMART POLLING 

CROSS REFERENCE TO RELATED APPLICATIONS 
The present application claims priority to US provisional application serial number 
5 60/520,658, which is herein incorporated by reference. 

BACKGROUND OF THE INVENTION 
Image recognition is generally performed by optical character recognition (OCR) > 
processing. An application for such image recognition is in the postal or mail handling arts 

1 0 wherein a destination address is read off of an address face of a mail item. Other applications 
may be envisioned by the skilled artisan. In order to ensure accurate reading or decoding of 
the image by OCR processing, multiple independent OCR processes may run concurrently or 
non-concuirently over a same image. Their respective results may be considered and/or 
compared in an effort to determine the most reliable processing results or decode of the 

1 5 scanned address. 

OCR processing in mail handing applications is a combination of four substantially 
independent processes: address block location, binarization, OCR processing and database 
lookup. In brief, address block location is the location of information on an address face of an 
envelope. Binarization is the transformation of gray-level images into binary. OCR 

20 processing is the mapping and identification of an image as an alpha or numeric character. 

Database look up is the rationalization of a stream of successive characters output by the OCR 
by matching the process results with an elaborate set of relational databases comprising postal 
code, city, street and addressee information that are used to identify a destination. The 
aforementioned processes, when taken together, are used to scan an address face image and 

25 map it, with reasonable certainty, into a sortation decision. For purposes of this application, 
the aforementioned will be referred to simply as OCR process. 

Given the OCR process complexity and the inconsistency of destination addresses, 
results of respective OCR processes vary in regards to accuracy. As such, a system and 
method of comparing and weighting the results of respective OCR processes is necessary in 

30 order to achieve overall results that are within an operable or working level or margin of error. 
Such levels or margins may vary upon application. However, assignment of weight and/or 
comparison level is a matter of statistics which may be applied by known computer means 
across a variety of applications. By voting or polling we can pool multiple independent OCR 
results and thereby the error rate inherent with OCR processes would be reduced. 

CONFIRMATION COPY 
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The general field of improving OCR processes has been addressed in the prior art. 
Figure 1 discloses an arrangement wherein several OCR processes 1-3 are arranged in series 
14. An image 10 is introduced into the first 1, then second 2, and then third 3 OCR process if 
the former processes fail to read and decode the image 10. If the image is effectively read and 
decoded by one of the three OCR processes, a result 12 is yielded. While effective in 
decoding images, this arrangement also maintains an error rate which may be too high for 
many applications. One reason for a high error rate lay in the all or nothing approach to 
image reading and decoding. Here, the image is either decoded by one of the three OCR 
processes or an error occurs. There is no in-between. 

Figure 2 depicts the three OCR processes (1-3) of figure 1 arranged in parallel 20, 
each further being connected to a voter 22. The voter attempts to find a consensus and selects 
among the OCR processes results of the image reading and decoding based on a majority rule. 
At least 2 of the 3 OCR processes must agree in order to decode the destination address for 
the polling to be effective. A problem with this method is the costs involved with operating at 
least three OCR processes, as well as gaining and working with often mutually incompatible 
OCR process internal proprietary processes that make reliability ranking difficult. 

Figure 3 depicts the parallel voter arrangement of figure 2 with two OCR processes. 
This represents a more economical arrangement than the requirement for 3 OCR processes per 
figure 2 or would represent the circumstance where one of the 3 OCR processes was totally 
unable to resolve the subject address The operation is essentially the same as in figure 2, 
however only two as oppose to three OCR processes are used. However, a decision based on 
a majority vote is not possible with only two OCR processes. 

In the prior art, several approaches for discrimination of final most reliable decode are 
given such as selecting which result represents the maximal depth of address decode or using 
datum internal (usually unique between OCR processes and manufacturer proprietary) to the 
respective OCR processes to assign related confidence level and select accordingly between 
contending alternative address decodes. 

Problems remain with the prior art processes, namely, that they remain susceptible to 
fault based on depth of decode caused by directory errors or poor thresholding. Additionally, 
the processes rely upon an all or nothing determination of OCR process performance. Yet 
another prior art solution entails accessing OCR internal processes so as to create a 
confidence level based upon internal performance levels of the OCR processes being 
employed. This solution carries with it the burdens, as above, of additional processing and 
access to often proprietary information associated with the OCR internal processing. 
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Additionally, reliability measures used by various vendors of OCR processes are often 
incompatible. Accordingly, a need exists for a practical polling of OCR processes which 
maximizes the information available to arrive at a best possible and most accurate possible 
result 

5 

SUMMARY OF THE INVENTION 
An advantage of the present invention is to enhance performance of two or more OCR 
processes in regards to reading and decoding an image. This and other objects are achieved 
by reducing the all or nothing approach of prior art solutions to a weighted tabulation of 

1 0 various performance successes of a particular reading and decoding by a particular OCR 

process. Such weight may be known in advance based upon assessment of past OCR process 
performances under similar circumstances and/or such performance data gathered over time. 
Such past performance is made available through appropriately stored data records which are 
accessed and otherwise retrieved upon appropriate OCR process application. Such data 

1 5 records may further be continually updated by using video coding operators to truth randomly 
selected polling decisions and thereby continually confirm and refine a given OCR process' 
relative performance based once again on categories that are nominally self-evident during the 
scanning and OCR process. Because such information is electronically stored, it is available 
to a large number of applications without geographical or language restrictions — the latter 

20 being overcome by standards application. 

The data records relate to an OCR process performance as applied to set events or 
categorizations that are nominally assessable during automatic processing. Such 
categorizations include: letter vs. flat vs. parcel, window envelope with transparency, numeric 
field vs. alpha characters field, character pitch and font, noticeable skew, handprint vs. 

25 machine print, color background, interference background (bleed through), matrix print, 

outward address, inward address, addressee, endorsement reading, and stamp value reading. 
Other considerations may also be used. 

The data records, based upon the aforementioned criteria, are statistically quantified 
so as to provide an OCR process based performance weights. As an example, we can select 

30 the OCR process to accept for the decode based on the statistically measured factors such as 
whether we are reading a flat versus a letter or combine in statistical fashion the respective 
factors of merit for a flat mail having numerics and a window envelope. 

Once determined, the results of that OCR process with respect to the aforementioned 
criteria will be given and the polling choice considered over the results of the other OCR 
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processes. Accordingly, the strong points, i.e. the most successful aspects, of each of a 
plurality of OCR process are polled to arrive at a composite resulting reading and decoding. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 
5 The above and other advantages of the present invention will become clear from the 
specification below and the claims appended thereto when taken in conjunction with the 
drawings wherein: 

Figures 1 to 3 depict prior art processes; 

Figure 4 depicts a performance monitoring of a plurality of OCR processes; 
1 0 Figure 5 depicts numerics performance; 
Figure 6 depicts letters performance; 
Figure 7 depicts flats performance; 

Figure 8 depicts an operation phase wherein a decision is weighted; 
Figure 9 depicts numerics weighting; 
1 5 Figure 10 depicts letters weighting; and 

Figure 1 1 depicts a flowchart of the present method. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention will now be discussed with respect to the above listed figures, starting 

20 with figure 4, wherein like numerals refer to like elements. Figure 4 depicts performance 
monitoring 40 wherein the OCR processes are polled 42 based upon individual results 
according to preset categorizations general to both OCR processes, the data of which is 
provided during manual encoding. The statistical categorizations include the following 
domains: letter vs. flat vs. parcel, window envelope with transparency, numeric field vs. alpha 

25 characters field, character pitch and font, measurable skew, handprint vs. machine print, color 
background, interference background (bleed through), matrix print, outward address, inward 
address, addressee, endorsement, and stamp value. Other considerations my be included as 
envisioned by one skilled in the ait 

Such a statistical categorization can be done by prior testing and be updated and 

30 refined by having encoders truth randomly selected polling events where the OCR processes 
differed. Encoders may receive every, almost every, or other number of unsuccessfully 
decoded images. Additionally, the number and type of categorization may vary upon 
application. Considering a world wide application and a typically numerical answer to such 
categorizations, the language of the categorization is inconsequential and the geographical 
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location of the encoders also equally fluid. Rather an indication of OCR process' 
performance with respect to at least one of the above criteria is sought. For purposes herein it 
will be assumed that (figure 4): an image 42 was fed to the three OCR processes 1-3. 
Although the invention has particular value when a decision needs to be made with only 2 (or 
5 an even number) OCR processes are in contention, the cited examples show 3 OCR processes 
in contention to stress the ease of assimilating multiple OCR processes by virtue of not 
requiring any internal specification or proprietary internal information. 

Figure 4 depicts performance based OCR processing 44. Hence, the OCR processes 
are polled and a decoding selected based upon prior computed statistical weighting per a 

10 categorization such as discussed above. In operation and as will be seen in the subsequent 
figures, once at least a workable amount of data is amassed concerning the individual OCR 
process performance per criteria or categorization, each OCR process may be so weighted for 
the decision process. Additional, resolution and refinement can be accrued by having 
operators truthed via random polling decisions and as dictated by the results update/refine the 

1 5 statistics supporting the categorization. 

By way of example, in figure 5, each OCR process 1-3 includes bar graphs 50, 52, 54, 
whose height represents the respective OCR process performance in successfully reading and 
decoding numerics 56. As depicted, OCR process 2 ranks highest (52), then OCR process 1 
(50), then OCR process 3 (54). In operation, the polling element 42 would consult the 

20 database for the relevant data records (depicted as bar graphs), electronically determine a 

largest value (herein 52) and provide a weighted value to OCR 2. Should the value be within 
acceptable application tolerances (rejecting a null hypothesis with the next closest OCR 
process), the OCR 2 reading and coding of numerics will be assumed correct. This data 
retrieval and evaluation is performed automatically by appropriate electronic means such as a 

25 properly programmed computer. 

Figure 6 depicts the above described process applied to the reading and coding of mail 
items, the mail items comprising, in this example, letters 66. The OCR processes each have a 
ranking 60, 62, 64 for performance of the letters. 

Figure 7, depicts the different OCR process rankings 70, 72, 74 as applied to reading 

30 and coding of flats 76. As may be appreciated, this arrangement applies to all considerations 
common to the OCR processes. 

Figure 8 depicts the decision process 80 which is automatically performed by the 
polling element 42. Other means, appropriately configured to effect the decision process may 
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be used with or in place of the polling. The amount of required data supporting a weight and 
application requirements for appropriate reading and coding vary. 

Figure 9, depicts weighted decisions with respect to numerics 96. As with the above, 
the weighted decision is depicted in bar graph form. The bar graphs of figure 9 (90, 92, 94) 
5 correspond in value to the bar graphs of figure 5 (50, 52, 54) which also dealt with numerics. 
The same relationship may be found between figure 10 (100, 102 and 104) and figure 6 (60, 
62, 64) the both of which deal with letters. 

Known statistical techniques, such as Null Hypotheses Testing may be used to map 
the encoder evaluations to a decision regarding an OCR's weight such that only statistically 
1 0 significant relative differences are reflected in the final polling decision process. 

Figure 1 1 depicts a flowchart of a method according to the step of scanning the image 
with at least two OCR processes 112. The present invention may be used with any number of 
OCR processes. A determination 1 14 is made whether all OCR processes successfully 
decoded the image. If the OCR processes did not successfully decode the image 116, then the 
1 5 method ends 118 and the image would most likely proceed to video coding. 

If the OCR processes successfully read the image 120, another determination 122 is 
made, namely whether the OCR processes produced a substantially same result. If the OCR 
processes produced substantially the same result with sufficient reliability as required by the 
current application 124, the need for polling is obviated and the method ends 118. 
20 If the OCR processes did not produce the substantially same result 123, the method 

continues with polling. Herein, a highest weighted OCR process categorization based 
performance is accepted as a correct decoding 136 and the process ends 118. 

A second polling related step includes manual truthing of randomly selected polling 
decisions so as to further improve the precision of the statistical inference 125. Accordingly, 
25 an operator video codes an image 126 and indicates a correctness of the polling decision and 
the statistics for the related OCR process further incremented or if the polling was in error, the 
related OCR process weights are decremented 128. The method then ends 118. 
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